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ABSTRACT 

Regulation of messenger ribonucleic acid (mRNA) 
subcellular localization, stability and translation is 
a central aspect of gene expression. Much of this 
control is mediated via recognition of mRNA 3 un- 
translated regions (UTRs) by microRNAs (miRNAs) 
and RNA-binding proteins. The gold standard ap- 
proach to assess the regulation imparted by a tran- 
script's 3' UTR is to fuse the UTR to a reporter cod- 
ing sequence and assess the relative expression of 
this reporter as compared to a control. Yet, transient 
transfection approaches or the use of highly active 
viral promoter elements may overwhelm a cell's post- 
transcriptional regulatory machinery in this context. 
To circumvent this issue, we have developed and 
validated a novel, scalable piggyBac-based vector 
for analysis of 3 UTR-mediated regulation in vitro 
and in vivo. The vector delivers three independent 
transcription units to the target genome — a selec- 
tion cassette, a turboGFP control reporter and an 
experimental reporter expressed under the control 
of a 3' UTR of interest. The pBUTR (piggyBaobased 
3' Un Translated Region reporter) vector performs ro- 
bustly as a siRNA/miRNA sensor, in established in 
vitro models of post-transcriptional regulation, and 
in both arrayed and pooled screening approaches. 
The vector is robustly expressed as a transgene dur- 
ing murine embryogenesis, highlighting its potential 
usefulness for revealing post-transcriptional regula- 
tion in an in vivo setting. 



INTRODUCTION 

Coordinated regulation of gene expression is fundamen- 
tally important for cellular division, differentiation and re- 
sponse to environmental cues. While the field of proteomics 
is rapidly advancing, the most broadly utilized practice in 
assessing coordinated regulation of gene expression is via 
analysis of messenger ribonucleic acid (mRNA) steady- 
state expression using either micro array (1) or Next Gen- 
eration Sequencing approaches (2-4). Both of these ap- 
proaches are exceptionally powerful, providing the ability 
to simultaneously monitor increases and decreases of the in- 
dividual gene products comprising the transcriptome. Yet, 
the information provided by either approach is limited in 
that it does not effectively reveal whether a given mRNA, 
irrespective of relative representation, is being actively used 
by the cell's translational machinery. 

Indeed, paired transciptomic and proteomic analyses 
have revealed varying but significant degrees of discordance 
between the relative expression of a given mRNA species 
and the protein encoded by these transcripts (5-9). The re- 
sults of these studies are generally consistent with the de- 
scribed role of post-transcriptional regulation of gene ex- 
pression in virtually every physiological context (10-13). 
Interestingly however, a recent study (9) suggested that 
mRNA translation rates are the major (~55%) contributor 
to protein expression in murine fibroblasts. While mRNA 
degradation rates were found to contribute to protein ex- 
pression levels to a much lower extent in this study (~5%), 
the results are consistent with a model where the aggregate 
regulation of mRNA stability and translation plays a signif- 
icant, if not dominant, role in protein expression. 

Much of the control of gene expression at the mRNA 
level is thought to be conferred via ds-regulatory elements 
in the non-coding 3' untranslated region (UTR) of a given 
mRNA (11,14,15). In various physiological states, these 
ds-regulatory elements may be recognized by microRNAs 
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(miRNAs) or RNA-binding proteins dictating transcript 
localization, stability and/or translation. In some systems, 
y UTR identity is largely sufficient to confer correct tem- 
porospatial gene expression in vivo (14). Recently, much in- 
terest has been accorded to both alternative splicing (16) 
and alternative cleavage and polyadenylation (17), both of 
which may alter 3 f UTR identity and thus visibility of re- 
lated gene products to the post-transcriptional regulatory 
machinery. Given that mutations within the UTRs of cer- 
tain genes can significantly impact human health, for ex- 
ample muscular dystrophy and schizophrenia (18-20), it is 
of great interest to determine if and how genomic varia- 
tions within the 3' UTR, uncovered via genome-wide as- 
sociation studies and other high-throughput screening ap- 
proaches, impact the pathology of the disease or phenotype 
with which they are associated. To this end, a scalable and 
robust reporter system for modeling these variations is de- 
sirable. 

The most common strategy for studying the impact of 3' 
UTR identity on protein expression is to fuse a 3' UTR of 
interest to a non-native reporter gene. Comparison of the 
relative expression of this reporter to a second (control) re- 
porter can then isolate differences in post-transcriptional 
regulation between the two reporters. Transient transfection 
approaches are widely used for this type of study, but are 
inherently unsuited for analysis over time (21) and may ob- 
scure or under-represent endogenous regulation due to sat- 
uration of the cell's regulatory machinery. The latter con- 
cern must also be taken into consideration when transient 
transduction of cells with a viral vector (e.g. adenovirus) is 
employed. While utilization of a single-copy genomic inte- 
grant circumvents many of these issues, the use of retro- or 
lentiviral vector systems for monitoring 3' UTR-based reg- 
ulation also suffers from several drawbacks. For example 
the difficulty in inclusion of a completely distinct control 
reporter, compulsory inclusion of irrelevant vector-derived 
sequence and the potential presence of commonly used sta- 
bility elements [e.g. the woodchuck hepatitis virus post- 
translationally regulated element (WPRE)] (22) confounds 
native post-transcriptional control, and retro- and lentiviral 
long terminal repeat elements may be silenced over time by 
the cell both in vitro and in vivo (23-27). 

To circumvent some of these limitations and facilitate 
higher-throughput analysis of 3' UTR-mediated control of 
gene expression in vitro and in vivo, we have engineered 
and validated a novel, scalable piggy Bac transposon-based 
reporter system, pBUTR (piggy Bac-based 3' Un Translated 
Region reporter). Originally isolated from the genome of 
the cabbage looper moth Trichoplusia ni (28), the piggy- 
Bac transposon has a large cargo size (29), is highly ac- 
tive in many cell types (30,31) and has been shown to ef- 
fect long-term expression in mammalian cells in vivo (32). 
The pBUTR vector system is comprised of three indepen- 
dent transcription units — a G418 selection cassette, a con- 
trol turbo-green fluorescent protein (tGFP) reporter cas- 
sette and a Gateway® (33) recombineering cassette under 
the control of the Ubiquitin C (UbC) promoter. Upon re- 
combination of turboRFP (tRFP), a 3 r UTR of interest, 
and a barcoded minimal polyadenylation site into this cas- 
sette, a bi-fluorescent reporter vector is produced that can 
be employed in both in vitro and in vivo model systems. Here 



we assess the performance of the pBUTR vector/reporter 
in the context of synthetic RNA interference (RNAi)-based 
siRNA/miRNA sensor activity, established models of post- 
transcriptional regulation by miRNAs and RNA binding 
proteins, arrayed and pooled screening approaches and in 
the context of murine embryogenesis. The reporter per- 
forms robustly in each of the scenarios tested and has the 
potential to be a valuable tool for prospective characteriza- 
tion of the impact of 3' UTR identity on gene regulation 
and function in both cell-based and animal studies. 

MATERIALS AND METHODS 

pBUTR vector construction 

The piggyBac (pB) transposon, pTpB, a generous gift 
from Dr Matthew H. Wilson (34), was used as the back- 
bone of the destination vector. The bovine growth hor- 
mone (Bgh) polyadenylation site was amplified from pUbC- 
KBPA-iFGFR 1 -F2A-Luc2-E (a kind gift of Dr Jeffrey M. 
Rosen hereafter referred to as pUbC) using oligos contain- 
ing Xmal and Agel sites and subcloned into the Agel site 
(located at the 3' end of the G418 cassette) of the pTpB vec- 
tor. The 5'-attRl flanked Cm R /ccdB cassette was amplified 
from pDEST17 (Life Technologies, Carlsbad, CA, USA) 
with oligonucleotides containing PstI and EcoRI sites and 
subcloned into the PstI and EcoRI sites of pUbC. A syn- 
thetically generated attR5 was subcloned into the EcoRI 
and Xhol sites of the modified pUbC. The complete attRl- 
Cm R /ccdB-attR5 cassette was subsequently subcloned from 
the modified pUbC into the pTpB transposon using SacII 
sites. The Pgk promoter was amplified from the pL45 vec- 
tor and inserted into pTurboGFP with Ndel and EcoRI. 
The chimeric intron from Rr-Luc-6xCXCR4 was ampli- 
fied and subcloned into the EcoRI and Agel sites of pTur- 
boGFP. Finally, the Pgfc-chimeric intron-turbo GFP-SV40 
polyadenylation cassette was amplified from the modified 
pTurboGFP plasmid into the SacII and BamHI sites of the 
pTpB vector. 

Entry vector construction 

pDONR 223 attP2r-attP4 was a kind gift of Kenneth Scott 
(Baylor College of Medicine). To engineer the pDONR 
attP4r/P5 entry vector, the fragment was excised from 
pDONR 223 attP2r-attP4, leaving only the attP4 site in 
the vector backbone. Separately, the EcoRV/Sall fragment 
containing the Cm R /ccdB cassette was subcloned into a 
synthetic plasmid containing an XbaI-EcoRV-Xhol-attP5- 
Nhel insert in the pIDTSMART vector (IDT DNA Tech- 
nologies, Coral ville, I A, USA). Finally, the Xbal/Nhelfmg- 
ment was subcloned from the modified pIDTSMART vec- 
tor into the initial pDONR223 construct from which the 
Sail fragment had been excised. 

Construction of donor vectors 

The Turbo-RFP donor plasmid was generated by poly- 
merase chain reaction (PCR) amplification of the cod- 
ing sequence of the turbo-red fluorescent protein (tRFP) 
from its commercially available vector (Evrogen, Farming- 
dale, NY, USA) using primers containing the attBl and 
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attB2 Gateway® recombination sequences. Donor plas- 
mids with siRNA/miRNA sensor elements or various 3' 
UTRs were generated by PCR amplification of synthetic 
oligonucleotides or UTR elements using primers containing 
the Gateway® attB2r and attB4 recombination sequences. 
The minimal polyadenylation/barcode element donor plas- 
mids were generated by amplification of an oligonucleotide 
{attR4j7iPA_barcodeMttL5 oligo) using primers containing 
Gateway® attB4r and attB5 recombination sequences. Bar- 
codes within the attR4jnPAJbarcode.attL5 oligonucleotide 
were generated using a sequence of mixed bases correspond- 
ing to the nucleotide frequency of bases following endoge- 
nous polyadenylation sites in the human genome. Follow- 
ing blood pressure recombination and transformation in 
Top 10 competent cells, entry clones were screened through 
colony PCR using Ml 3 forward and reverse primers and 
all putative positive clones were sequence verified. Relevant 
oligonucleotide sequences are listed in Supplementary Ta- 
ble SI. 

Construction of expression reporters 

Complete expression reporters were generated via four part 
recombineering using the destination vector with equimo- 
lar amounts of the three donor plasmids — the tRFP en- 
try clone, the donor plasmid containing the 3' UTR of 
interest and the pool of donor plasmids containing the 
minimal polyadenylation signal (35) and barcode — using 
Gateway LR Clonase II enzyme mix (Life Technologies, 
Carlsbad, CA, USA). Recombination reactions were trans- 
formed into One Shot Mach- 1 competent cells (Life Tech- 
nologies, Carlsbad, CA, USA), which were plated on LB- 
Agar containing both ampicillin and kanamycin. All prop- 
erly recombined expression vectors were initially identified 
via colony PCR and subsequently sequence verified. 

Cell culture and treatment 

Hela, MCF7, MCF10A and U937 cell lines were ob- 
tained from ATCC (Manassas, VA, USA). MCF10A cells 
were cultured in Dulbecco's modified Eagle's medium 
(DMEM)/F12 medium (Life Technologies, Carlsbad, CA, 
USA), containing 5% horse serum (Life Technologies, 
Carlsbad, CA, USA), 0.01-mg/ml bovine insulin (Cell Ap- 
plications, San Diego, CA, USA), 0.5-jxg/ml hydrocorti- 
sone (Sigma- Aldrich, St. Louis, MO, USA), 100-ng/ml 
cholera toxin (Sigma-Aldrich, St. Louis, MO, USA), 20- 
ng/ml human EGF (Peprotech, Rocky Hill, NJ, USA) 
and 100-U/ml penicillin and 0.1-mg/ml streptomycin (Life 
Technologies, Carlsbad, CA, USA). MCF7 and Hela cells 
were cultured in DMEM medium, containing 10% fetal 
bovine serum (FBS) (Lonza, Walkersville, MD, USA) and 
100-U/ml penicillin and 0.1-mg/ml streptomycin. U937 
cells were cultured in RPMI1640 (ATCC, Manassas, VA, 
USA), containing 10% FBS and 100-U/ml penicillin and 
0.1-mg/ml streptomycin. Cells were kept at 37°C un- 
der a humidified atmosphere of 5% CO2. Where indi- 
cated, MCF7 and MCF10A cells were treated with TGFpl 
(R&D Systems, Minneapolis, MN, USA) at a final con- 
centration of 5 ng/ml for 72 h. U937 cells were treated 
with lipopolysaccharide (LPS) from Escherichia coli 026:B6 



(Sigma-Aldrich, St. Louis, MO, USA) at a final concentra- 
tion of 1 |xg/ml for 24 h. 

Transfection and stable clone generation 

Hela cells (2 x 10 5 ) were transfected using Lipofectamine- 
2000 (Life Technologies, Carlsbad, CA, USA). MCF7 (4 x 
10 4 ), MCF10A (4 x 10 4 ) and U937 (10 5 ) cells were trans- 
fected using Lipofectamine LTX (Life Technologies, Carls- 
bad, CA, USA), each according to the manufacturer's in- 
structions. Plasmids containing transposase (pCMV-HA- 
mlpB) and transposon (respective pBUTR vector) were 
used at a ratio of 1:2. Forty-eight hours after transfec- 
tion, cells were split 1:10 and subsequently selected with 
G418 (1000 fxg/ml for MCF7, MCF10A and Hela and 250 
ixg/ml for U937 cells) (Teknova, Hollister, CA, USA) for 
~2 weeks. Indicated miRNA mimics and inhibitors (Life 
Technologies, Carlsbad, CA, USA) were used at 30 nM, 
whereas ZFP36 siRNA (Life Technologies, Carlsbad, CA, 
USA) was used at 300 nM. Transiently transfected cells were 
harvested and analyzed at 24 h post transfection. 

Flow cytometry and cell sorting 

Expression of tGFP, tRFP, E-Cadherin and CD 8 6 were de- 
termined by flow cytometry using a FACSCalibur system 
(BD Biosciences). For assessing E-Cadherin expression, 
MCF7 and MCF10A cells were scraped in calcium and 
magnesium free phosphate buffered saline (PBS) (Life Tech- 
nologies, Carlsbad, CA, USA) containing 1-mM ethylene- 
diaminetetraacetic acid (EDTA), pelleted at 129 g for 3 
min and stained with allophycocyanin (APC) CD324 (E- 
Cadherin) antibody (clone 67A4) (BioLegend, San Diego, 
CA, USA). For CD86 protein expression, U937 cells were 
pelleted at 290 g for 5 min, then stained with APC CD86 an- 
tibody (BioLegend, San Diego, CA, USA). Hela cells were 
harvested by trypsinization. All cells were resuspended in 
PBS containing 2% FBS and 1-mM EDTA and counter- 
stained with 10-|xg/ml propidium iodide (PI) (Roche Diag- 
nostics, Indianapolis, IN, USA) to allow exclusion of dead 
cells. At least 30 000 events were collected for each analy- 
sis. Data were analyzed using Flow Jo version 9 (Tree Star, 
Ashland, OR, USA). 

To obtain reference points for setting the flow cytometry 
gates for cell sorting, single dye positive and negative con- 
trol samples were prepared for each of the fluorescent sig- 
nals (tGFP, tRFP, E-cadherin and PI) used. The top 10% 
of tRFP positive population in indicated MCF10A cells 
(which were all GFP positive) were sorted by a BD FACS 
Aria II cell sorter (BD Biosciences) using 100-|xm filter and 
20-psi nozzle pressure. The cells were collected in FBS and 
immediately processed for genomic deoxyribonucleic acid 
(DNA) isolation. 

Genomic DNA isolation, library preparation and limited next 
generation sequencing 

Genomic DNA was isolated from indicated populations of 
MCF10A cells using overnight lysis (100-mM NaCl, 20- 
mM Tris, pH 7.6, 10-mM EDTA, pH 8.0, 0.5% odium 
dodecyl sulphate and 0.5-mg/ml proteinase K) at 55°C 
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before salting out with 60% volume saturated NaCl and 
ethanol precipitation. Barcoded PCR primer pairs (Supple- 
mentary Table SI) were designed and used to amplify the 
3 / -UTR-intrinsic unique barcode elements in the indicated 
populations. An Ion Torrent adapter-ligated library was 
made following the manufacturer's Ion Plus Fragment Li- 
brary Kit (Life Technologies, Carlsbad, CA, USA) protocol 
(#4471252, Revision 3.0). The resulting libraries were pu- 
rified using AMPure beads (Agencourt, Beckman Coulter, 
Brea, CA, USA) and the concentration was determined us- 
ing Quant-iT PicoGreen dsDNA Assay Kit (Life Technolo- 
gies, Carlsbad, CA, USA). Sample emulsion PCR < emul- 
sion breaking and enrichment were performed using the Ion 
PGM Template OT2 200 Kit (#4480974, Revision 5.0) fol- 
lowing manufacturer's instructions. The samples were pre- 
pared for sequencing using the Ion PGM 200 Sequencing 
Kit (#4474004, Revision C) and the complete samples were 
loaded on an Ion 314 chip and sequenced on the PGM. 
Data from the PGM runs were processed initially using the 
bam2fastq to generate the fastq files and custom Perl scripts 
were used to trim adapter sequences, filter and to determine 
the percent representation of the different barcodes in the 
indicated populations. 

Preparation of whole cell lysates and immunoblot analysis 

Cells were lysed in buffer containing 25 -mM Tris-HCl 
pH 7.4, 150-mM NaCl, 1-mM EDTA, 1% NP-40 and 
5% glycerol containing complete, Mini protease inhibitor 
cocktail (Roche Diagnostics, Indianapolis, IN, USA). Ten 
micrograms of whole cell lysate was resolved on a Nu- 
PAGE 4-20% gel (Life Technologies, Carlsbad, CA, USA), 
transferred to an Immobilon PVDF membrane (Millipore, 
Billerica, MA, USA) and probed with E-cadherin anti- 
body (clone 24E10) (Cell Signaling Technology, Danvers, 
MA, USA) and N-cadherin antibody (Cell Signaling Tech- 
nology, Danvers, MA, USA). The blot was subsequently 
stripped and re-probed for Hsp90 (clone C45G5) (Cell 
Signaling, Danvers, MA, USA) to confirm equal loading. 
The blots were imaged using enhanced chemiluminescence 
(ECL) Plus western blotting substrate (Pierce, Rockford, 
IL, USA) and HyBlot chemiluminescence (CL) autoradio- 
graphy film (Denville Scientific, Metuchen, NJ, USA). 

Generation and injection of pBUTR-ZEBl expressing em- 
bryonic stem cells (ESCs) 

All experiments were performed in accordance with the 
guidelines of the Baylor College of Medicine Institutional 
Animal Care and Use Committee. V6.5 ESCs (36) de- 
rived from Fl hybrid strain (C57BL/6 x 129/Sv) were a 
kind gift from Dr Rudolf Jaenisch. ESCs were cultured 
at 37°C in 5% CO2 in complete ES medium composed 
of DMEM (Millipore, Billerica, MA, USA), 15% FBS 
(Hyclone, Rochester, NY, USA), 1000-U/ml leukemia in- 
hibitory factor (LIF) (ESGRO, Millipore, Billerica, MA, 
USA), 1% (3-mercaptoethanol (Millipore, Billerica, MA, 
USA), 1% non-essential amino acids (Millipore, Billerica, 
MA, USA), 1% L-glutamine (Life Technologies, Carlsbad, 
CA, USA), 0.5% penicillin/streptomycin (Life Technolo- 
gies, Carlsbad, CA). 



V6.5 ESCs (5 x 10 6 cells) in suspension in IX PBS 
were nucleofected using a Bio-Rad GenePulser (Hercules, 
CA, USA) with 1 (jLg each of pBUTR-ZEBl wild-type 
(wt) vector and pCMV-HA-m7pB transposase (38). Post- 
nucleofection, cells were plated onto 100-mm culture dishes 
with a feeder layer. After selection with G418 (300 |xg/ml) 
for 8 days, resulting ESC colonies were verified for tGFP 
and tRFP expression, expanded and frozen. pBUTR-ZEBl 
wt ES cell clones were injected into 2-N (3.5-days post 
coitus) C57BL/6 blastocysts and subsequently transferred 
to the uterine horns of 2.5-days post coitus pseudopregnant 
Imprinting Control Region (ICR) recipient female mice. 

Embryo harvest, tissue preparation and imaging 

Pregnant females were sacrificed by carbon dioxide asphyxi- 
ation at day 11.5 postcoitus (el 1.5). Embryos were dissected 
and fixed in 4% paraformaldehyde for 1 h, cryoprotected in 
15% and 30% sucrose in PBS and embedded in optimal cut- 
ting temperature (OCT) compound (Sakura Finetek, Tor- 
rance, CA, USA) before cryosectioning at 10 |xM. The sec- 
tions were mounted on SuperFrost Plus slides (Thermos 
Fisher Scientific, Houston, TX, USA) using Vectashield 
(Vector Labs, Burlingame, CA, USA). 

Images were obtained using a Zeiss LSM 510 META con- 
focal laser scanning microscope (Carl Zeiss Microimaging, 
Thorn wood, NY, USA). Each tissue section was initially 
centered manually using a Carl Zeiss EC Plan-Neofluar 
10x/0.3 objective. Sections were tile scanned using a 5 x 8 
grid pattern (at 898.24 |xm 2 /grid) allowing for a resolution 
of 512 x 512 pixels per field at a depth of 28.29 |xm. 

RESULTS 

Vector design and construction 

We considered that an ideal vector for monitoring post- 
transcriptional regulation at the mRNA level would con- 
tain three independent transcription units, including a se- 
lection marker, a control expression cassette and the experi- 
mental expression cassette. Further considerations included 
a desire for ubiquitously expressed cellular (rather than vi- 
ral) promoter elements to reduce the risk of saturation of 
the post-transcriptional regulatory machinery, and the in- 
clusion of chimeric introns in each of the two reporter ele- 
ments to ensure reasonable expression levels in the absence 
of non-endogenous and potentially confounding RNA sta- 
bility elements. We thus modified the previously described 
pTpB vector [(34) — a kind gift from Dr Matthew Wilson, 
Baylor College of Medicine] to conform to this architec- 
ture (Figure 1). Briefly, the promoter and coding sequence 
of the pre-existing Neo/G418 selection cassette in this vec- 
tor were retained and terminated by introducing a Bgh 
polyadenylation signal. 3' to this selection cassette, we in- 
serted an Ubiquitin C (UbC) promoter element upstream of 
a modified Gateway® selection cassette in which the stan- 
dard chloramphenicol resistance marker (Cm R ) and ccdB 
bacterial suicide gene were flanked by attRl and attR5 re- 
combination sites. Finally, a tGFP cassette driven by the 
murine phosphoglycerate kinase 1 (Pgk) promoter was in- 
serted downstream of the former two elements such that it 
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Figure 1. Schematic representation of the pBUTR vector backbone. attXN, Gateway® recombination site; tRFP, turboRFP; UTR, untranslated region; 
mPA, minimum polyadenylation signal (35); BC, 24-nt barcode; PAS, polyadenylation signal; SV40 (left), SV40 early promoter region. Neo, neomycin- 
resistance gene; Bgh, bovine growth hormone polyadenylation signal; UbC, Ubiquitin C promoter element; Cm R , chloramphenicol-resistance gene; PGK, 
murine phosphoglycerate kinase 1 promoter; tGFP, turboGFP; SV40 (right) SV40 late polyadenylation signal. Features not to scale. 



was terminated by SV40 late polyadenylation signal present 
in the original parent vector. 

The unconventional configuration of the attR sites in the 
completed pBUTR (piggy Bac 3 f Un Translated Region re- 
porter) destination vector was designed to take advantage 
of existing local open reading frame (ORF) entry clone li- 
braries existing in an attLl /L2 format. The pBUTR desti- 
nation vector is functionalized by four-part Gateway® re- 
combineering using an a^L7/L2-flanked coding sequence 
of interest, an attR2 /attL4-fiankQd 3 f UTR element and 
an attR4/attL5-flanked minimal polyadenylation sequence 
(35) followed by a unique 24-nucleotide barcode. The com- 
position of the barcode, generated via mixed nucleotide syn- 
thesis, was informed by the average nucleotide composi- 
tion of the 24 base pairs following the G/U-rich region 
of native polyadenylation sequences in the human genome. 
The inclusion of unique barcode elements with the minimal 
polyadenylation signal was made to allow analyses within 
pooled cell populations via flow cytometry and cell sorting. 

Validation of pBUTR functionality using synthetic miRNA 
sensors and response to RNAi-mediated targeting 

We first assessed the performance of the pBUTR vector as 
a siRNA/miRNA sensor, recombineering tRFP under the 
control of three distinct 3' UTR elements. Each element 
contained a tandem duplicate of synthetic sequence per- 
fectly complementary to the broadly used synthetic CXCR4 
siRNA (37), the mature human miR-17 miRNA or the ma- 
ture human miR-124 miRNA. Each of the constructs, along 
with the pCMV-HA-m7pB transposase (38,39), was individ- 
ually transfected into the Hela cell line. Stable transfectants 
were isolated by G418 selection and tGFP expression was 
verified by flow cytometry. Hela cells are known to express 
miR-17 (40); however, miR-124 expression is largely lim- 
ited to neuronal lineages (41). As expected, tRFP expression 
(monitored by flow cytometry) in the Hela cells stably trans- 
fected with the miR-1 7 sensor (mock treatment, Figure 2C) 
was significantly reduced as compared to Hela cells stably 
transfected with either the sensor to the synthetic CXCR4 
siRNA (mock treatment, Figure 2A) or the miR-124 sensor 
(mock treatment, Figure 2B). Transient transfection of the 



cells carrying the CXCR4 sensor with the CXCR4 siRNA, 
or cells carrying the miR-124 sensor with a synthetic miR- 
124 mimic, resulted in a marked decrease in tRFP expres- 
sion (Figure 2 A and B). In contrast, transient transfection 
of the cells carrying the miR-ll sensor with an miR-17 'an- 
tagomir' (42) resulted in a robust increase in tRFP expres- 
sion (Figure 2C). No change in tGFP protein expression 
was observed in any of the examined conditions (Figure 2). 
Taken together, these experiments confirm the functional- 
ity of the pBUTR vector system, including the G418 R selec- 
tion marker, the tGFP control expression cassette and the 
recombineered tRFP experimental expression cassette. The 
results of the experiments suggest that the pBUTR vector is 
a valid reagent for in situ monitoring of miRNA or siRNA 
activity via synthetic, perfectly complementary target sites. 

Monitoring endogenous 3' UTR-mediated post- 
transcriptional regulation by miRNAs using the pBUTR 
system 

Given the validation of the functionality of our bi- 
fluorescent pBUTR reporter system in the context of syn- 
thetic miRNA/ siRNA sensor elements, we next set out to 
assess the performance of the reporter system in monitor- 
ing well-characterized models of post-transcriptional regu- 
lation by miRNAs and RNA-binding proteins. 

We first assessed miRNA-mediated repression in a 
cell-based model of epithelial to mesenchymal transition 
(EMT). The E-cadherin transcriptional repressors ZEB1 
(also known as 8EF1) and ZEB2 (also known as SIP1) play 
established roles in EMT and tumor metastasis (43). The 
mRNA transcripts of both of these gene products are char- 
acterized by multiple, validated miR-200 family recognition 
elements in their respective 3' UTRs (43). Cells with an ep- 
ithelial phenotype express high relative levels of the miR- 
200b miRNA, which enforces post-transcriptional repres- 
sion of the ZEB1 and ZEB2 mRNA transcripts. However, 
as cells undergo EMT, for example in response to trans- 
forming grown factor-beta (TGF-(3), relative levels of miR- 
200b are reduced, allowing increased expression of ZEB1 
and ZEB2 proteins and transcriptional repression of the 
CDH1 (E-cadherin) gene. 
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Figure 2. Use of the pBUTR system as a siRNA or miRNA sensor. Flow cytometric analysis of Hela cells stably transfected with pBUTR recombineered 
to contain tRFP fused to synthetic 3' UTR elements in which two tandem sequences perfectly complementary to the indicated siRNA/miRNA are present. 
(A) Response of Hela cells carrying a CXCR4 sensor to transiently transfected CXCR4 siRNA (blue) or mock transfection (red). (B) Response of Hela cells 
carrying an miR-124 sensor to transiently transfected miR-124 mimic (blue) or mock transfection (red). (C) Response of Hela cells carrying a tRFP miR-17 
sensor to transiently transfected a.nti-miR-17 antagomir (blue) or mock transfection (red). All data are representative of a minimum of three individual 
experiments performed 24 h post-transient transfection on G418-selected cells. tRFP, turboRFP; UTR, untranslated region; Antag, antagomir. 



Previously described (41) wt and mutant (where each 
miR200b-bindmg site has been ablated via site-directed 
mutagenesis) ZEB1 and ZEB2 3' UTR elements were re- 
combineered into the pBUTR destination vector so as to 
confer regulation upon tRFP expression in the assembled 
reporter. Both the human non-transformed mammary ep- 
ithelial cell line MCF10A and the human breast adenocar- 
cinoma cell line MCF7 were stably transfected with each of 
the four resulting pBUTR reporters. Following G418 selec- 
tion, cells were treated with TGF-(3 or vehicle for 72 h. As 
expected, MCF10A cells switched from polarized, tightly 
packed discoid epithelial cells to highly motile fibroblas- 
tic or mesenchymal phenotype, characteristics of distinct 
morphological changes associated with EMT (44,45), while 
MCF7 cells, which are refractory to TGF- (3 -mediated EMT 
(44,46), maintained epithelial morphology (Figure 3A). 
MCF-1 OA-specific EMT was further verified in the sta- 
bly transfected cell lines via immunoblot, which demon- 
strated a reduction in E-cadherin protein expression con- 
comitant with an induction of the mesenchymal cell marker 
N-cadherin in TGF-p-treated MCF10, but not MCF7, cells 
(Figure 3B). 

We next employed multicolor flow cytometry to assess the 
expression of the wt and mutant ZEB reporters in each of 
the stably transfected cell lines. As expected, treatment of 
MCF-1 OA cells with TGF-(3 resulted in decreased surface 
levels of E-cadherin protein. In cells stably transfected with 
the wt ZEB1 and ZEB2 reporters, these decreased levels co- 
incided with marked increases in tRFP fluorescence. The 
levels of tRFP fluorescence in these cells, as assessed via me- 
dian fluorescence intensity, were similar to those observed 
in MCF-1 OA cells transfected with mutant ZEB1 and ZEB2 
reporters. In the latter cells, although decreased surface lev- 
els of E-cadherin were observed upon TGF-(3 treatment, 
tRFP levels remained constant (Figure 3C). TurboGFP flu- 
orescence levels were essentially unchanged in all experi- 
mental conditions (Supplementary Figure SI A). 

In contrast to the stably transfected MCF10A cells, sur- 
face E-cadherin levels in TGF- (3 -treated MCF7 cells were 



effectively identical when compared to mock treated con- 
trols (Figure 3D). tRFP fluorescence in these cells was sig- 
nificantly elevated in MCF7 cells carrying the mutant, as 
compared to the wt, ZEB1 and ZEB2 reporters. Once again, 
levels of tGFP fluorescence derived from the control re- 
porter cassette were largely consistent (Supplementary Fig- 
ure SIB). 

Taken together, these experiments are consistent with 
well-characterized models (43) in which the miR-200 fam- 
ily binding sites in the ZEB1 and ZEB2 mRNA transcripts 
mediate negative post-transcriptional regulation of these 
transcripts by miR-200b in the epithelial, but not the mes- 
enchymal state. Nonetheless, to further support this conclu- 
sion we examined the response of the reporters to transient 
transfection of anti-miR-200b antagomirs and miR-200b 
mimics in mock treated MCF7 and MCF10A and TGF-p- 
treated MCF10A cells. Anti-miR-200b transfection resulted 
in increased relative tRFP fluorescence derived from wt, but 
not mutant, ZEB1 and ZEB2 reporters in both untreated 
MCF7 and MCF10A (Figure 4A and B and Supplemen- 
tary Figure S2A and B). In contrast, transient transfection 
of miR-200b mimic into TGF-p-treated MCF10A cells re- 
sulted in decreased relative tRFP fluorescence derived from 
wt, but not mutant, ZEB1 and ZEB2 reporters (Figure 
4C and Supplementary Figure S2C), confirming that miR- 
200b activity was both necessary and sufficient for post- 
transcriptional regulation of the ZEB1 and ZEB2 reporters 
in this model of EMT. 



Performance of pBUTR as a reporter of 3' UTR-mediated 
post-transcriptional regulation by RNA-binding proteins 

Given the performance of the pBUTR reporter in the con- 
text of monitoring defined miRNA targets in a physiolog- 
ically relevant context, we next assessed whether it per- 
formed similarly in the context of regulation by RNA- 
binding proteins. One of the best-characterized examples of 
this type of regulation is observed within myeloid cells of the 
immune system, where tumor necrosis factor alpha (TNF- 



Page 7 of 14 



Nucleic Acids Research, 2014, Vol. 42, No. 10 e86 




MCF10A 



wt-ZEB1 



mu-ZEB1 




i L10 4 



10" 



10' i 



10 l 




10° 



io u io' 10' i(r 10' 



tRFP 
mu-ZEB2 



10' 























- ■ ■ ' r.r.ij 


.-,31^' 



tRFP 



10 u 10' 10' 10" 3 10* 



tRFP 



D 



c 

0) 
_c 

"D 

03 

O 
UJ 



MCF7 



1 o"* 



wt-ZEB1 



Mock 
TGF-p) 


72 


h) 








m 







CD 
JZ 

~o 

03 

O 
i 

LU 



10" 



10 J 



tRFP 
wt-ZEB2 



tRFP 



J no 4 



10" 10' 10' 10° 10* 




J HO 4 



10 u 10' 10' 10 J 10* 



mu-ZEB1 



tRFP 
mu-ZEB2 



tRFP 



10" 10' 10' 10° 10* 




10 u 10' 10' 10" 5 10* 



Figure 3. Monitoring endogenous miRNA-mediated regulation in vitro. (A) Phase contrast micrographs of untreated or TGF-(3-treated MCF10A (top 
panels) and MCF7 (bottom panels) cells. Images were obtained at x 10 magnification. (B) Immunoblot (IB) analysis of E-cadherin (upper panel) and N- 
cadherin (middle panel) protein levels in MCF7 and MCF10A cells treated with TGF-p for 72 h. The blot was stripped and re-probed for Hsp90 (bottom 
panel) as a loading control. (C) and (D) Flow cytometric analysis of mock treated (red) and TGF-(3-treated (blue) MCF10A (C) and MCF7 (D) cells 
transfected with vector recombineered to contain tRFP under the control of the wild-type (wt) ZEB1 (top panels) or ZEB2 (bottom panels) 3 / -UTR, 
which is responsive to miR-200, or a mutant (mu) 3 / -UTR in which the miR-200 recognition elements are deleted (43). All data are representative of a 
minimum of three individual experiments. UTR: untranslated region. 



a) mRNA stability is reduced in response to signaling by 
the Toll-like Receptor 4 (TLR4) receptor (47). While stimu- 
lation of TLR4 by LPS stimulates transcription of the TNF- 
a mRNA, at the same time levels of the tristetraprolin (also 
known as ZFP36) protein are increased, which destabilizes 
the TNF-a mRNA transcript via recognition and binding of 



adenosine/uracil-rich element (ARE) located in the 3' UTR 
of the transcript (47). 

We recombineered tRFP into the pBUTR vector un- 
der the control of wt (wt-TNF-a) or mutant (A-TNF-a, 
in which the ARE was deleted via site-directed mutagen- 
esis) TNF-a y UTR elements. These vectors were used to 
transfect the human monocytic macrophage cell line U937, 
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Figure 4. Differential regulation of tRFP expression in the pBUTR ZEB1 and ZEB2 3' UTR reporters was mediated by miR200b expression levels 
associated with EMT. (A) and (B) Flow cytometric analysis 24 h post-transient transfection of 2inti-miR-200b antagomir (blue) in untreated MCF7 (A) 
and MCF10A (B) cells transfected with vector recombineered to contain tRFP under the control of the wild-type (wt) ZEB1 (top panels) or ZEB2 
(bottom panels) 3' UTR. (C) Flow cytometric analysis 24 h post-transient transfection of a.nti-miR-2 00b mimic (blue) in TGF-(3-treated (72 h) MCF10A 
cells transfected with vector recombineered to contain tRFP under the control of the wild-type (wt) ZEB1 (top panels) or ZEB2 (bottom panels) 3 f UTR. 
All data are representative of a minimum of three individual experiments. tRFP, turboRFP; UTR, untranslated region. 



and stable transfectants were obtained via G418 selection. 
Stably transfected cells were mock-treated or treated with 
LPS for 24 h and then analyzed by flow cytometry using 
CD86 protein induction as a marker for activation (Fig- 
ure 5A). Unstimulated U937 cells stably transfected with 
the wt-TNF-a (Figure 5B) and A-TNF-a (Figure 5C) re- 
porters were characterized by robust tRFP fluorescence. 
As expected, tRFP fluorescence was markedly decreased in 
LPS-treated U937 cells stably transfected with the wt-TNF- 
a reporter (Figure 5B). In contrast, no reduction in tRFP 
fluorescence was observed in LPS-treated U937 cells stably 
transfected with the A-TNF-a reporter (Figure 5C). Once 
again, tGFP fluorescence was essentially identical in each 
case (Figure 5B and C). 

To further confirm that the decrease in wt-TNF-a re- 
porter stability and hence tRFP fluorescence was due to 
ZFP36 protein activity, U937 cells harboring the wt or mu- 
tant TNF-a y UTR elements were transiently transfected 
with siRNAs targeting ZFP36 before being treated with 
LPS. Silencing of ZFP36 gene expression (Figure 5D) com- 
pletely attenuated reduction in tRFP fluorescence in LPS- 
treated U937 cells stably transfected with the wt-TNF-a 
reporter (Figure 5E). No effect on tRFP fluorescence, as 
expected, was observed when ZFP36 expression was si- 
lenced in LPS-treated U937 cells stably transfected with 
the A-TNF-a reporter (Figure 5F). Taken together, our ex- 



periments are consistent with the notion that the pBUTR 
vector/reporter system is well suited for monitoring known 
or novel instances of 3' UTR-mediated post-transcriptional 
regulation, whether by miRNAs or RNA-binding proteins, 
in a broad spectrum of in vitro models of cellular physiology. 



Performance of pBUTR in a pooled screening approach to 
monitor endogenous 3' UTR-mediated post-transcriptional 
regulation by miRNAs 

The pBUTR vector was functionalized with Gateway® 
technology to allow high-dimensionality screening and val- 
idation applications. Given that Gateway® recombineering 
is scalable — meaning multiple individual 3' UTR elements 
can be cloned into the vector in bulk — an inclusive, aggre- 
gate set of y UTRs of interest can be rapidly generated to 
determine whether each or any of these 3 f UTRs confers 
any of the observed translational regulation in a given phys- 
iological context. As a proof of concept, we elected to as- 
sess y UTR-mediated responsiveness to TGF-(3 treatment 
and EMT in MCF10A cells for 1 1 distinct genes (TAP BP L, 
HIST1H4K, CCNL1, PLK4, PDZK1IP1, WBP4, SYS1, 
RNF6, SUV39H2, ZEB1 and ZEB2). The eleven reporter 
constructs were built in bulk, sequence verified and indi- 
vidually stably transfected into MCF-10A cells, which were 
subsequently selected with G418. 
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Figure 5. pBUTR system can be used to study preferential regulation of mRNA stability by ds-elements. (A) Flow cytometric analysis of U937 cells 
transfected with vector recombineered to contain tRFP under the control of the wild-type (wt) TNF-a S'-UTR for CD86 protein expression (positive 
activation marker) post-LPS stimulation for 24 h. (B) and (C) Flow cytometric analysis of untreated (red) or 24-h LPS-treated (blue) U937 cells transfected 
with vector recombineered to contain tRFP under the control of the wild-type (wt) S'-UTR (B) or mutant (mu) TNF-a S'-UTR in which the AU-rich 
elements are deleted (C). (D) Immunoblot (IB) analysis of ZFP36 (upper panel) protein levels in U937 cells harboring wt or mutant TNF-a S'-UTR 
reporters transiently transfected with siRNA-ZFP36 post-LPS stimulation for 24 h. The blot was stripped and re-probed for Hsp90 (bottom panel) as a 
loading control. (E) and (F) Flow cytometric analysis of untreated (red) or 24-h LPS-treated (blue) U937 cells harboring wt (E) or mutant (F) TNF-a 
S'-UTR reporters transiently transfected with siRNA-ZFP36 vector. All data are representative of a minimum of three individual experiments. tRFP, 
turboRFP; UTR, untranslated region; LPS, lipopolysaccharide. 



We first assessed 3 f UTR-mediated responsiveness to 
TGF-(3 signaling and EMT in an arrayed format. Each sta- 
bly transfected line was treated with TGF-(3 or vehicle for 72 
h. EMT was monitored both morphologically and by loss of 
E-cadherin expression via flow cytometry. As expected, cells 
stably transfected with ZEB1 and ZEB2 3 f UTR reporter 
constructs were marked by an increase in tRFP expression 
in TGF- (3 -treated cells. In contrast, no increase in relative 
fluorescence was observed in the other nine gene products 
(Figure 6A). 

We next tested our ability to replicate these results in the 
context of a pooled screening approach. MCF-10A cells 



stably transfected with each of the 1 1 individual reporters 
were mixed together such that each reporter was equiva- 
lently represented within the population. This mixture of 
cells was divided to two pools, and each of these pools was 
treated with TGF-(3 or vehicle, respectively, for 72 h. Once 
again, EMT was monitored both morphologically and by 
loss of E-cadherin expression via flow cytometry. Ten per- 
cent of each pool of cells was then collected for genomic 
DNA isolation. The remaining 90% of each population of 
cells were sorted via flow cytometry to obtain the subpopu- 
lation (10%) of cells characterized by the highest tRFP ex- 
pression. Genomic DNA was isolated from these tRFP hl 



e86 Nucleic Acids Research, 2014, Vol 42, No. 10 



PAGE 10 OF 14 



c 

CD 
> 
LU 



SYS1 



PLK4 



c 

CD 
> 
LU 





Mock 
TGF-p 


I 



















10 u 10' 1(T 1CT 10, 



tRFP 
HIST1H4K 













A 



























10" 10' 10" 10° 10 



tRFP 




SUV39H2 



RNF6 



10 


10 2 


10 3 


< 


tRFP 
PDZK1IP1 




















I 






















3° 10 1 10 2 1( 






wt-ZEB2 



tRFP 



tRFP 



tRFP 



B 



Transfect 11 pBUTRs (inclusive 
of wt-ZEB1 and wt-ZEB2) into 
MCF10A Cells 

G418 Selection 



Selected GFP + cells 



5dG 

A. 

-P (+) 

/\ /\ 

Total Top 10% Total Top 10% 
population tRFP hl cells population tRFP hl cells 



(-) TGF-P 



(+)TGF-(5 



100 




■ Other 
DZEB1 3"-UTR 
UZEB2 3'-UTR 



Total tRFP' 
(-) TGF-p 



Total tRFP hl 
(+) TGF-p 



Figure 6. pBUTR system is scalable and is amenable to arrayed and pooled screens to study endogenous post-transcriptional gene regulation. (A) 3' UTR- 
mediated responsiveness to TGF-p signaling and EMT in an arrayed format, Flow cytometric analysis of mock treated (red) and TGF-(3-treated (blue) 
MCF10A cells transfected with vector recombineered to contain tRFP under the control of the indicated 3'-UTRs. (B) Schematic representation of the 
experimental design to test the pBUTR system in a pooled screen. (C) Comparison of relative pBUTR barcode sequence abundance from each of the four 
populations [untreated, untreated tRFP hi , treated (TGF- 0 for 72 h) and treated tRFP hi (TGF- 0 for 72 h) MCF10A cells] via limited Next Generation 
Sequencing. tRFP, turboRFP; UTR, untranslated region. 
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populations. We next amplified the pBUTR barcode se- 
quences from each of the four populations (untreated, un- 
treated tRFP hl , treated and treated tRFP hl ) and assessed 
the relative abundance of the barcodes in each of these four 
populations via limited Next Generation Sequencing (Fig- 
ure 6B). 

Comparison of relative representation of barcodes in the 
top 10% sorted RFP positive cells in vehicle and TGF-(3- 
treated MCF10A cells revealed an enrichment for ZEB1 
and ZEB2 3' UTRs in the sorted cells post-TGF-p treat- 
ment (6.25% ZEB1 and 3.78% ZEB2 in vehicle treated 
compared to 47.06% ZEB1 and 47.06% ZEB2 in TGF-0 
treated) (Figure 6C). This enrichment was not due to a tran- 
scriptional upregulation of ZEB1 and ZEB2, as evident by 
similar barcode abundance of ZEB1 and ZEB2 along with 
the other nine 3' UTRs in the total populations isolated 
from the vehicle and TGF- (3 -treated MCF10A cells (Figure 
6C). Taken together, these results showed that the pBUTR 
system is both scalable when performed in an arrayed for- 
mat and amenable to pooled screening approaches via flow 
cy tome try-based cell sorting and barcode sequencing. 

In vivo monitoring of post-transcriptional regulation using the 
pBUTR vector 

Cell-line-based models are valuable tools that aid in the un- 
derstanding of gene regulation events underlying cellular 
physiology. However, it is often desirable to assess the appli- 
cability of the findings derived from these models via trans- 
genesis in vivo, whether in the context of mammalian devel- 
opment or disease. Transposon-based systems for transge- 
nesis have some degree of value for such an endeavor, since 
there is currently little evidence that they are subject to the 
same silencing mechanisms hampering the use of retro- and 
lentiviral vectors for this purpose (23-25). We thus under- 
took a proof-of-concept approach to assess the potential 
usefulness of the pBUTR reporter vector for revealing post- 
transcriptional regulation in an in vivo setting. 

The expression patterns of the ZEB1 gene product during 
murine embryonic development have been described (48). 
We stably transfected murine ESCs with our pBUTR vec- 
tor in which tRFP is expressed from the Ubc promoter un- 
der the control of the wt ZEB1 3' UTR. G4 18 -resistant 
ESCs were isolated and used for blastocyst injections, and 
chimeric embryos were harvested at the equivalent of 11.5 
days post-coitus (el 1.5) for histological analysis of reporter 
expression. Sagittal sections of the el 1.5 embryos revealed 
relatively even expression of tGFP throughout the embryo 
(Supplementary Figure S3 A). Strikingly, while low levels of 
tRFP fluorescence were observable throughout the embryo, 
we observed markedly increased fluorescence in many re- 
gions known to be populated by migrating mesenchymal 
cells derived from the neural crest (Figure 7 and Supple- 
mentary Figure S3). Increased tRFP fluorescence was ob- 
served in the nasal process, in the maxillary and mandibu- 
lar arches, in the area of the trigeminal ganglion and in the 
rhombic lip. Additional increases in fluorescence were ob- 
served in the medial and lateral myotome, as well as within 
non-condensed regions in the distal portion of the limb bud 
(Figure 7). These areas largely overlap previously described 
domains of ZEB1 expression at this embryonic stage (48), 



which would be consistent with a model in which the ZEB1 
y UTR is sufficient to confer correct temporospatial ex- 
pression of the tRFP reporter during murine development. 
While our analysis is not complete enough to draw any firm 
conclusions in this regard, the data do highlight the poten- 
tial of the pBUTR reporter for in vivo analysis of 3' UTR- 
mediated post-transcriptional regulation. 

DISCUSSION 

We have developed and tested a piggy Bac-b^Qd. vector for 
monitoring 3' UTR-mediated post-transcriptional regula- 
tion of gene expression in in vitro and in vivo settings. The 
vector facilitates delivery of three independent transcription 
units to the target genome — a selection cassette, a constitu- 
tively expressed tGFP control reporter and a constitutively 
expressed experimental reporter, purposed here for moni- 
toring the regulation conferred to a tRFP reporter by dis- 
tinct y UTRs of interest. 

In the context of a stably integrated 3' UTR reporter sys- 
tem, the pBUTR vector has several advantages over com- 
peting approaches. The vector is able to deliver a compar- 
atively large payload to the genome of essentially any cell 
type amenable to transfection, and recent advances utilizing 
chimeric transposase/ZFN constructs facilitate site-specific 
targeting of /?i?-based vectors to discrete sites within the 
genome (49). As compared to retro/lentiviral delivery sys- 
tems, pBUTR virtually eliminates the hazards of exposure 
to potentially infectious agents derived from packaged vec- 
tor or in vitro recombination with endogenous retroviral ele- 
ments. Even more importantly from an experimental stand- 
point, delivery of completely non- overlapping control and 
experimental transcription units in the context of a lentivi- 
ral or retroviral vector poses several problems. Architec- 
ture consisting of two reporters separated by 2A peptide se- 
quences, internal ribosome entry sites or even multiple tran- 
scriptional promoters will result in shared y UTR identity 
among the reporters or significant inclusion of irrelevant 
and potentially confounding UTR sequence, respectively. 
While use of a bidirectional promoter in the viral vector 
may significantly reduce these risks, this raises the danger 
of cryptic splicing or polyadenylation sites in the reverse or 
antisense transcript (50). 

In contrast, the larger relative cargo capacity of the 
pBUTR vector allows simultaneous delivery of multiple 
fully independent transcription units. Since transposons 
do not excise and integrate through an RNA intermedi- 
ate, chimeric splice junctions can be included within each 
transcription unit, rendering associated mRNA transcripts 
less susceptible to nonsense mediated decay pathways (51). 
Because the messages are not fed into these pathways, 
commonly employed elements promoting mRNA stabil- 
ity and translation [such as the woodchuck hepatitis virus 
posttranscriptional regulatory element/WPRE (22)] are un- 
necessary. In fact, for studies of 3' UTR-mediated post- 
transcriptional regulation, omission of stability elements is 
highly desirable since these elements might be expected to 
confound physiologically relevant regulation by the cellular 
machinery. 

To reduce the possibility that our reporters might con- 
found physiologically relevant regulation conferred by a 3' 
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Figure 7. Patterns of expression of a ZEB1 3 f UTR reporter in vivo. Composite of confocal immunofluorescence images derived from sagittal section of 
an el 1.5 murine embryo at xlO magnification. The embryo was derived from ESCs stably transfected with the pBUTR-ZEBlwt vector. Scale bar = 10 
|xm. TG, trigeminal ganglion; RL, rhomboid lip; MX, maxillary arch; ML, mandibular arch; LM, lateral myotome; HL, hind limb bud; EC, endocardial 
cushion. *L, liver (autofluorescence). Red: tRFP expression. Green: tGFP expression. Blue: 4',6-diamidino-2-phenylindole (DAPI) staining. 



UTR of interest, for example by saturation of the cell's reg- 
ulatory machinery, we have employed two distinct ubiqui- 
tously active transcriptional promoters (Pgk and Ubc) char- 
acterized by a consistently low relative level of expression 
(52). A potential drawback of this approach is that the two 
distinct promoters might behave somewhat differently as 
a function of cellular type or state. However, in an exper- 
imental context, this risk can be largely offset by inclu- 
sion of appropriate control reporters with minimal or other- 
wise defined 3' UTR elements. Since both promoter activity 
and relative reporter stability will be a function of cellular 
state, any difference in expression must be due to 3' UTR- 
mediated regulation in this case. It is however important to 
underscore that relative reporter expression within this sys- 
tem does not differentiate between mechanisms impacting 
mRNA stability or translational repression. This distinc- 
tion would have to be addressed in downstream experimen- 
tation. 

The functionalization of the pBUTR vector with 
Gateway® technology was implemented in anticipation of 
high-dimensionality screening and validation applications. 
Since Gateway® recombineering is scalable — meaning 
multiple individual 3' UTR elements can be cloned into 
the vector in bulk — an inclusive, aggregate set of 3' UTRs 
of interest derived from a polysomal profiling or riboso- 
mal protection experiment can be rapidly generated to 



determine whether each or any of these 3' UTRs confers 
any of the observed translational regulation. Alternatively, 
comprehensive reporter libraries might be constructed to 
complement RNA immunoprecipitation/ sequencing- type 
studies via validation of potential regulatory interactions 
or prospective identification of regulation associated 
with a particular physiological context. With regard to 
prospective screening approaches, it should be noted that 
a drawback of the pBUTR system, relative to a retro- or 
lentiviral approach, is that stable transfection of cells in 
bulk with a pool of vectors is not straightforward. This 
would at first appear to argue against the use of the system 
in pooled screening approaches. However, provided that 
the initial transfection and selection are performed in an 
arrayed format, the inclusion of unique barcode elements 
with the minimal polyadenylation signal allows analysis of 
enrichment or depletion within pooled cell populations via 
flow cytometry and cell sorting. 

Finally, while the proof-of-concept experiments de- 
scribed in this study make use of pBUTR as a tRFP 3' re- 
porter, the system is also directly adaptable to modeling the 
impact of single nucleotide polymorphisms or other muta- 
tions within a given 3' UTR on the function of associated 
genes in vitro or in vivo model systems. This application may 
be useful for rapid analysis of candidate phenotypic drivers 
derived from large-scale genome-wide association or other 
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large-scale screening efforts aimed at the identification of 
sequence variants contributing to human disease. 
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